Document Layout Analysis


Document layout analysis (DLA) is the process of analyzing a document's spatial arrangement of content to understand its structure and layout. This includes identifying the location of text, tables, images, and other elements as well as the overall structure, such as headings and subheadings. DLA helps in extracting and categorizing information and automating document processing workflows.

MinerU2.5: A Decoupled Vision-Language Model for Efficient High-Resolution Document Parsing

Add code
Sep 26, 2025
Viaarxiv icon

ICDAR 2025 Competition on FEw-Shot Text line segmentation of ancient handwritten documents (FEST)

Add code
Sep 16, 2025
Viaarxiv icon

Automated Evidence Extraction and Scoring for Corporate Climate Policy Engagement: A Multilingual RAG Approach

Add code
Sep 10, 2025
Viaarxiv icon

Stitching the Story: Creating Panoramic Incident Summaries from Body-Worn Footage

Add code
Sep 04, 2025
Viaarxiv icon

Improving OCR for Historical Texts of Multiple Languages

Add code
Aug 14, 2025
Viaarxiv icon

From Surface to Semantics: Semantic Structure Parsing for Table-Centric Document Analysis

Add code
Aug 14, 2025
Viaarxiv icon

DocRefine: An Intelligent Framework for Scientific Document Understanding and Content Optimization based on Multimodal Large Model Agents

Add code
Aug 09, 2025
Viaarxiv icon

DocTron-Formula: Generalized Formula Recognition in Complex and Structured Scenarios

Add code
Aug 01, 2025
Viaarxiv icon

Unsupervised Document and Template Clustering using Multimodal Embeddings

Add code
Jun 13, 2025
Viaarxiv icon

SCAN: Semantic Document Layout Analysis for Textual and Visual Retrieval-Augmented Generation

Add code
May 20, 2025
Viaarxiv icon